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Abstract 

The most abundant of the modified nucleosides, and once considered as the "fifth" nucleotide in RNA, is pseudouridine, 
which results from the action of pseudouridine synthases. Recently, the mammalian pseudouridine synthase 1 (hPusI p) has 
been reported to modulate class I and class II nuclear receptor responses through its ability to modify the Steroid receptor 
RNA Activator (SRA). These findings highlight a new level of regulation in nuclear receptor (NR)-mediated transcriptional 
responses. We have characterised the RNA association and activity of the human Puslp enzyme with its unusual SRA 
substrate. We validate that the minimal RNA fragment within SRA, named H7, is necessary for both the association and 
modification by hPuslp. Furthermore, we have determined the crystal structure of the catalytic domain of hPusIp at 2.0 A 
resolution, alone and in a complex with several molecules present during crystallisation. This model shows an extended C- 
terminal helix specifically found in the eukaryotic protein, which may prevent the enzyme from forming a homodimer, both 
in the crystal lattice and in solution. Our biochemical and structural data help to understand the hPuslp active site 
architecture, and detail its particular requirements with regard to one of its nuclear substrates, the non-coding RNA SRA. 
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Introduction 

Pseudouridine is a modified uridine known to be essential for 
the function of most classes of non-coding RNAs (ncRNA) such as 
tRNAs, rRNAs, snoRNAs, or snRNAs [1,2]. Pseudouridines are 
present in RNA from bacteria to mammals, and their synthesis is 
due to a protein family named pseudouridine synthases (PUS; [3]). 
Pseudouridine synthases are divided into six distinct families: 
TruA, TruB, TruD, RluA, RsuA, and Pus 10, with the last one 
being present only in archaea and eukaryotes [4,5] . Atomic models 
for various members of these families have been solved and show a 
conserved catalytic core despite very low sequence homology 
between them [5,6,7,8,9]. In addition, several secondary structure 
elements or entire domains are found around the structurally 
conserved core of particular members [5,7,10]. Humans have a 
variety of pseudouridine synthases, which act on diverse classes of 
ncRNAs. One of the first identified was the pseudouridine 
synthase 1 (hPuslp), which is a member of the TruA family 
despite their low sequence similarity (<20%;[11]). The hPuslp 
enzyme was identified in the late 1990's on the basis of its 
sequence similarity with the homologous yeast enzyme. The 
eukaryotic Pusl enzyme must localise in the mitochondria, the 
cytoplasm, and the nucleus based on the location of its identified 



substrates or partners [12,13,14]. More recently, the enzyme was 
shown to co-localise with particular nuclear receptors in the 
nucleus [12,15,16]. 

Although members within the PUS family do not exhibit 
extensive sequence homology, they share an enzymatic domain 
that presents a high degree of structural similarity [11]. The active 
site is located in between the two lobes of the catalytic core 
[6,17,18,19,20]. PUS enzymes are highly specific, capable of 
recognising their target uridine when embedded in a particular 
structural context, avoiding random uridine modification within 
RNA molecules. The hPuslp enzyme is no exception, although it 
appears to have a more relaxed sequence specificity compared to 
other pseudouridine synthases [2 1] . The TruA family is the most 
divergent compared to the other families [4]. The major sites of 
modification by the eukaryotic Puslp enzyme are positions 27, 28, 
34, and 36 within tRNAs [22,23]. In addition, yeast Pusl has been 
shown to modify U2 snRNA [24]. A few years ago, the Steroid 
receptor RNA Activator, a ncRNA emanating from the SRA gene, 
was characterised as a target of Puslp [12,15,16]. Multiple sites 
within the SRA were shown to be subject to pseudouridine 
modification, although only U206 within the H7 element was 
identified unambiguously [15]. Lasdy, the hPuslp enzyme is 
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involved in the metabolic syndrome causing mitochondrial 
myopathy and sideroblastic anemia (MLASA; [25]). 

We have characterised the catalytic domain of the hPuslp 
protein biochemically and structurally. A truncated protein has 
significant levels of activity towards a target tRNA and on the 
specific H7 element from the SRA, when compared to the full- 
length hPuslp enzyme. We also measured the affinity of the 
truncated form of hPuslp (AhPuslp) for various H7 SRA 
substrates, which correlates with the observed activities. We 
determined the structure of the catalytic domain of AhPuslp and 
the D146A mutant of this enzyme. We observe several molecules 
in the active site although their significance for the pseudouridine 
reaction remains unclear. Despite the low sequence-dependence 
shown for pseudouridylation by hPuslp, we suggest that the 
secondary structure of the substrate RNA is important for hPuslp- 
mediated activity. The atomic model of AhPuslp further 
demonstrates why the hPuslp enzyme binds to its RNA substrate 
as a monomer. The C-terminal end of hPuslp folds into two ot- 
helices and these helices pack against the back of the hPuslp 
molecule, which prevents the dimerization observed in the 
bacterial homologue, TruA [6]. We propose that this C-terminal 
region is responsible for the atypical RNA substrate binding and 
activity of hPuslp. 

Materials and Methods 

Expression and purification 

The fragment of hPuslp encompassing residues 83-394 
(referred to as AhPuslp hereafter) was amplified by PGR using 
appropriate oligonucleotides from the hPuslp-containing plasmid 
obtained previously [22]. We subcloned this fragment, using 
restriction sites Pcil and Xhol, into a modified pET vector for 
subsequent over-expression in Escherichia coli. The protein was 
expressed in E. coli BL21 (DE3) Star with an N-terminal 
9Xhistidine tag for purification. Briefly, cells were grown in TB 
medium at 37°C, and subsequently incubated overnight at 18°C 
with 0.1 mil isopropyl thio-(3-D-galactoside (IPTG). Cells were 
harvested by centrifugation at 4000 xg, and resuspended in 
20 mM HEPES (pH 7.0) containing 500 mM NaCl, 5% glycerol, 
30 mM imidazole, 2 mM CHAPS, 1 mM DTT, DNase 1 (1 ug/ 
ml), lysosyme (1 ug/ml), and a cocktail of protease inhibitors 
(phenylmethylsulphonyl fluoride 1 mM, leupeptin 2 ug/ml and 
pepstatin 2 ug/ml). Cells were lysed using an EmulsiFlex C3 
(Avestin) homogenizer, and were clarified by centrifugation at 
30,000 xg for 30 minutes at 4°C. The protein was purified with an 
affinity chromatography step on a nickel-chelating resin (HisTrap, 
GE Healthcare), and eluted in the same buffer supplemented with 
500 mM imidazole. The tag was cleaved with Tobacco Etch Virus 
(TEV) protease for 10 hours at 4°C. The cleaved sample was 
reloaded to the affinity column to eliminate the uncleaved protein, 
the tag, the protease, and remaining contaminants. The protein 
was applied to a gel filtration Superdex S75 16/60 column (GE 
Healthcare) pre-equilibrated with 20 mM HEPES (pH 7.0) 
containing 150 mM NaCl, 5 mM MgCl 2 and 5 mM TCEP. 
The purity and homogeneity of the protein were assessed by SDS- 
PAGE. Full length (FL) hPuslp was expressed and purified in the 
same manner. 

Site directed mutagenesis 

The D146A mutation was introduced to AhPuslp using the 
Quick Change mutagenesis kit (Stratagene) according to manu- 
facturer instructions. The oligonucleotides used were: 

5'CAGCGCTGCGGCCGGACAGCCAAGGGTGTGTCC- 
GCAGCC and its reverse complement. The mutation was 



confirmed by sequence analysis. The D146A mutant version of 
AhPuslp was purified by methods described for AhPuslp. 

Crystallisation and structure determination 

For crystallisation assays, proteins were concentrated to 7 mg/ 
ml, and used for crystallisation after 2 days. We used the sitting 
drop technique for crystallisation, and mixed the protein with 
the reservoir solution in a 1:1 ratio. Crystals of AhPuslp 
appeared after 2-3 days in a buffer containing 30% PEG 4000/ 
glycerol, 0.1 M Bicine/TRIS (pH 8.5), 0.02 M amino acids 
mixture (0.02 M sodium L-glutamate, 0.02 M DL-alanine, 
0.02 M glycine, 0.02 M DL-lysine HC1, 0.02 M DL-serine) at 
4°C. The D146A AhPuslp mutant crystallised in the following 
buffer: 20% PEG 8000, 0.1 M TRIS (pH 8.5) at 4°C. Optimized 
crystallisation conditions are 28% PEG 3350/glycerol, 0.1 M 
Bicine/TRIS (pH 8.5), and 10% amino acids mixture for the 
AhPuslp and 17% PEG 8000, 0.1 M HEPES (pH 8.0) for the 
D146A mutant. 

Crystals were flash cooled in liquid nitrogen after cryo- 
protection with the reservoir solutions enriched with 10% PEG 
400 and 5% glycerol. Data collection from single crystals was 
performed at ID23-2 at the ESRF (Grenoble, France), and PXIII 
at the SLS (Villingen, Switzerland). About 20 data sets were 
collected from single crystals and co-crystals, with only slight 
variation in the cell parameters. Datasets were integrated and 
scaled using XDS [26]. Generally, crystals belong to space group 
P2i2]2i or P22i 2 1 with two or one monomer(s) per asymmetric 
unit, respectively. Datasets suffered from anisotropy and pseudo- 
translational symmetry, which initially prevented us from solving 
the structure by molecular replacement. We finally solved the 
structure with PHASER [27] using Puslp as a model (pdb id: 
4ITS; [28]). 

Alternative cycles of maximum likelihood refinement were 
performed with PHENIX [29], and model fitting was carried 
out with COOT [30]. Structures of AhPuslp and the AhPuslp 
D146A mutant were refined to 2.0 A and 2.7 A resolution, 
respectively. Crystallographic data are summarized in Table 1. 
Strong positive electron density peaks were observed next to the 
catalytic residue D146 in AhPuslp or the A146 residue in the 
mutated enzyme. Omit maps, calculated with only the protein 
models, were used to fit putative ligands found in the 
crystallisation condition (see Figure S5). Anisotropic scaling, 
bulk solvent correction, and TLS (Translation/Libration/ 
Screw) restraints were used throughout the refinement proce- 
dure. Solvent molecules were added last in the unassigned peaks 
of the F () -F c Fourier electron density map. The models show 
good stereochemistry with no residues in the disallowed region 
of the Ramachandran plot. The accession number for the 
coordinates of the structures of AhPuslp and D146A AhPuslp 
have been deposited with PDB codes 4NZ6 and 4NZ7, 
respectively. 

Fluorescence anisotropy measurements 

Several H7 SRA oligoribonucleotides were ordered with a 
fluoro-uridine at the U206 equivalent position and a fluorescein 
probe at the 5' -end for the fluorescence anisotropy measurements 
(Dharmacon). Although we used a 5-fluorouridine (5-FU)- 
containing RNA, our experimental conditions did not generate a 
significant amount of covalently bound protein/RNA complexes 
(Figure SI). Steady-state fluorescence anisotropy measurements 
were performed in a 50 uE high precision fluorescence quartz cell 
(HellmaAnalytics, light pass 3x3 mm) with a Jobin-Yvon Horiba 
Fluoromax-4 spectrophotometer equipped with emission and 
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Table 1. Data col 


ection and refinement statistics. 








AhPusIp 


AhPusIp D146A 


Beamline 


X06DA SLS 


ID 23-2 ESRF microfocus 


Data processing 


Resolution (A) 


50-2.0 (2.05-2.00) 


50-2.7 (2.77-2.70) 


Crystal space group 


P2,2,2, 


P22,2, 


Cell parameters (A) 


a = 71.42; b = 75.12; c= 110.58 


a = 39.45; b = 69.04; c = 1 1 6.78 


Unique reflections 


40510 (2984) 


9281 (686) 


Rmeas (%)" 


8.5 (55.8) 


14.9 (71.9) 


Mean redundancy 


4.3 (4.2) 


5.0 (5.1) 


CC/2 (%) 


99.8 (82.9) 


99.6 (88.5) 


Completeness (%) 


99.3 (98.8) 


99.8 (100) 


Mean l/a (%) 


16.4 (2.5) 


13.8 (3.4) 


Refinement 


Resolution (A) 


30-2.0 


20-2.7 


rmsd bond (A) 


0.002 


0.005 


rmsd angle (°) 


0.633 


0.863 


Rcrys, (%) b 


18.40 


18.33 


Rfrae (%)' 


22.41 


23.81 


Number of atoms: 


protein 


4949 


2419 


Bound compounds 


20 


15 


PEG 


14 


10 


water 


479 


110 


Average B factors: 


protein (A 2 ) 


28.4 


17.1 


Bound compounds (A 2 ) 


56.9 (Lys), 42.9 (Glu) 


20.9 (HEPES) 


solvent (A 2 ) 


49.8 


22.4 


water (A 2 ) 


36.8 


15.2 


Ramachandran 


allowed {%) 


97.6 


95.6 


favoured (%) 


2.4 


4.4 



Values in parentheses are for highest-resolution shell. 

Xeas = Zhki [N/(N-1)] 1/2 £ |l(hkl)-<l(hkl)> |/£ hld l(hkl), where N is the multiplicity of a given reflection. 
"(Tryst — Elfobsl - |Fcaicl|/ZI F obsl for all reflections. 
c fif ree was calculated on the 5% of data excluded from refinement. 
doi:1 0.1 371 /journal.pone.009461 0.tOOl 



excitation polariser filters. The samples were kept at 20°C with a 
VWR digital thermostat bath. 

Anisotropy titrations were carried out by adding increasing 
amounts of FL hPuslp (or AhPusIp, or D146A AhPusIp) to a 
fixed concentration of Fluoroscein-RNA in 10 mM HEPES 
(pH 7.0) containing 150 mM NaCl, 5 mM MgCl 2 and 5 mM 
TCEP. Binding constants were determined for a constant SRA 
concentration fixed at 1 |J.M. As Fluorescein was the fluorescent 
marker for these measurements, the excitation wavelength was 
adjusted to 490 nm, and the emission wavelength was fixed at 
516 nm. The Scatchard equation was rewritten to fit the 
anisotropy, r, as follows: 



r=ra + 

/ (l + (l/^)((l/«)S, + J',))- v /(l + (l/^)((l/ n )S,+f,)) 2 -4(l/ n )(l/<: J ) 2 S,f; 



where P t and S t designate the total concentration of enzyme and 
Fluoroscein-SRA, respectively; r f represents the anisotropy at the 
plateau when all the enzyme is bound; and r 0 and r correspond to 
the anisotropy values of Fluoroscein-SRA in the absence and in 
the presence of a given concentration of enzyme, respectively. 
Values k d and n correspond to the apparent dissociation constant 
and the number of binding sites of the enzyme for the SRA 
fragment, respectively. Titration curves were fitted with Microcal 
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Origin 6. 1 software based on the non-linear, least-squares method 
and the Levenberg-Marquardt algorithm. Each anisotropy data 
point has been measured four times in two independent 
experiments. 

RNA synthesis and pseudouridine formation assay 

RNA substrates were synthesized using a sense T7 DNA 
oligonucleotide (5'TAATACGACTCACTATAG), an antisense 
template oligonucleotide (see below), and a high concentration of 
T7 RNA Polymerase, in the presence of 25 uCi [5- 3 H]-UTP 
(23.9 Ci/mmol, Moravek) with 100 uM cold UTP and 1.0 mM 
non-labeled ATP, GTP, and CTP as previously described 
(Milligan and Uhlenbeck, 1989). The antisense template oligo- 
deoxynucleotides used for RNA synthesis were: mouse mitochon- 
drial tRNA Asp (with mutations T31C, T32C, and A39G), 
5'TAAGATATATAGATTATTGATCTATAATTTAACCTT- 
GACAGGGTTATGTAATTGATTTTACTAATATGTTCTA- 
TAGTGAGTCGTATTA; full-length H7 SRA, 5'CCCA- 
CAGGTGGGGACCTGGGAGCCTTACTTGAAGGAGGTG- 
GAGGC C CC ATTGGGGCTATAGTGAGTC GTATTA; trun- 
cationl of H7 SRA, 5'CCCACAGGTGGGGACCTCGAAA- 
GGCCCCATTGGGGCTATAGTGAGTCGTATTA; trunca- 
tion 2 of H7 SRA,5 ' C C C AC AGGTGGGGC GAAC CCC ATT- 
GGGGCTATAGTGAGTC GTATTA; truncation 3 of H7 SRA, 
5 ' G AC AGGTGGGGC GAAC C C ATTGGCTATAGTGAGTC - 
GTATTA; truncation 4 of H7 SRA, 5'CCCACAGG- 
TATTGGGGCTATAGTGAGTCGTATTA. 

The H7 SRA substrates were heated in water to 95°C for 2 
minutes, allowed to cool slowly to 25°C (1 hour 15 minutes, 
ambient temperature 22-23°C), and then held at 25°C for 5 
minutes. Substrate reactions (100 JLLl) with human pseudouridine 
synthases or fi-galactosidase (3.2 ug protein per assay) were 
50 mM ammonium chloride, 5 mM DTT, 25 mM Tris (pH 7.5) 
and 5 mM MgCl 2 , with the annealed RNA added to the other 
components at 25°C. Three separate assays were performed for 
each protein and RNA substrate combination. Incubations were 
held at 25°C for 4 hours, and then mixed with 500 |il Norit A 
charcoal (15% (w/v) in 0.2 N HC1), mixed vigorously, and then 
incubated at 37°C for 30 minutes with intermittent shaking. The 
charcoal was pelleted (1 6,000 xg, 1.5 min) and the supernatant 
was filtered through a SPIN-X filter unit (Costar, Cambridge, MA) 
for 2 min at 8,000 xg. The filtrate was mixed with 10 ml of 
scintillation fluid, and counted for 10 minutes [31,32]. 

Results and Discussion 

The hPuslp enzyme was shown to modify the non-coding SRA 
in a complex post-transcriptional mechanism, which regulates 
nuclear receptor-induced transactivation [12]. Recently the 
secondary structure of the SRA was determined [33]. The 
previously described SRA substructure STR5 was redefined and 
named H7 [12,33]. Moreover, it was shown to harbour a slightly 
different structural organisation compared with the initial predic- 
tion [33]. In particular, the loop containing the targeted U 
(position 236 according to the 5'-end used in [33]) is asymmetric. 
We have used the latest H7 structural organisation to design 
various H7 substructures to investigate hPuslp/SRA interactions 
(Figure 1A). 

FL and AhPusI p enzymes are able to bind SRA substrates 

First, we determined the binding stoichiometry and the binding 
affinities for three different hPuslp constructs using the fluores- 
cein-labelled H7 RNAs (Figure IB). We have used the human 
full-length enzyme (FL hPuslp), AhPuslp or AhPuslp D146A. 



The D146A mutated protein was further shown to behave as the 
WT enzyme in solution (Figure SI). FL hPuslp and the truncated 
forms of hPuslp bind to the various RNA constructs with a 
stoichiometry between 0.7 and 1 (Table 2) indicating that human 
Puslp binds the H7 SRA as a monomer. This was also consistent 
with multiple angle light scattering measurements performed on 
AhPuslp (data not shown). This result was somewhat surprising 
since the bacterial TruA enzyme was shown to function as a dimer 
and crystallised as a dimer [6] . This also opens the question of how 
hPuslp binds to its substrate RNAs (tRNA or another class of 
RNA). It was shown previously that both molecules of the TruA 
dimer contact the tRNA substrate [34]. It therefore suggests that 
eukaryotic Puslp enzymes will not recognise the elbow region of 
their tRNA substrates since these enzymes bind as a monomer, in 
contrast to the bacterial TruA enzymes [34]. Moreover, as 
observed for the human enzyme, yeast Puslp is also active as a 
monomer [35]. The fact that the hPuslp enzyme acts as a 
monomer reflects the general properties of PUS enzymes from the 
other families, which were also shown to function as monomers 
[5,7,8,9,10,36]. 

The measured binding affinities range from 100 to 500 nM 
indicating that hPuslp recognises the H7 SRA sequence with an 
affinity similar to its reported affinity for tRNAs (Table 2; Figure 
S2; [21]). Furthermore, we observe that the D146A mutation leads 
to a 10-fold decrease in the affinity of the enzyme for the full- 
length H7 substrate (K d of 2.2 |JM instead of 0.2 (xM seen with the 
WT enzyme) clearly indicating that removing the catalytic 
aspartate destabilises the complex formation. Previous studies 
have stressed the importance of the catalytic D146 for the 
enzymatic activity of hPuslp [12,37]. Furthermore, the K d value 
of the mutant enzyme is comparable to the Kj measured for PUS 
enzymes from other subfamilies carrying the same homologous 
mutation at the active site (Table 2; [38]). It therefore indicates 
that the absence of the aspartate significantly destabilises complex 
formation, which is surprising since the proposed enzymatic role of 
the aspartate is to attack the C6 position of the uridine rather than 
stabilise the bound RNA [39] . 

Truncated hPuslp does have slight differences in affinity with 
H7 SRA substrates compared with the FL hPuslp (Table 2). 
Dissociation constants measured between FL hPuslp and the 
various H7 sequences are either equal or two-fold lower than the 
ones seen with AhPuslp, revealing that the truncated N- and C- 
terminal sequences are not essential for the stable association with 
the H7 RNA constructs (Figure IB and Table 2). In detail, 
truncation 1 corresponds to the removal of distant sequences from 
the H7 stem loop (Figure 1A). The missing nucleotide sequence 
does not prevent stable complex formation, clearly indicating that 
hPuslp association seems to be restricted to the region closer to the 
uridine previously identified as the site of modification by hPuslp 
[15]. Therefore, association of hPuslp with H7 SRA is similar to 
its association with tRNA since it was shown previously that 
removing distal parts of the tRNA, such as the acceptor stem, did 
not impair association [21]. To further refine the hPuslp binding 
site on the H7 SRA, we generated the H7 truncations 2, 3, and 4. 
These three sequences are apparently recognised by both hPuslp 
proteins equally (Table 2). Therefore, we conclude that hPuslp 
only binds the nucleotide region surrounding either side of the 
targeted uridine (Figure 1A). This corroborates previous findings 
that RNA sequence immediately upstream or downstream of the 
targeted uridine does not significantly influence the overall affinity 
of hPuslp [21,37]. 
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Figure 1. Human pseudouridine synthase 1 binds to various SRA constructs. (A) Secondary structures and sequences of the SRA constructs 
used in the binding tests with the putative U to ¥ position indicated. (B) Fluorescence anisotropy measurement of SRA binding to different hPusI p 
constructs. Fluorescein-conjugated SRA sequences were incubated with increasing amounts of the indicated proteins (full length hPusI, filled circle; 
AhPusIp, filled square; AhPusIp D146A, filled triangle). Normalized anisotropy values from 2-3 experiments were plotted against protein 
concentrations to measure binding affinities. 
doi:1 0.1 371 /journal.pone.009461 0.gOOl 



Pseudouridine synthase assays of hPusIp enzymes tRNA 
and HP7 SRA RNA substrates 

The activities of the three hPuslp constructs (FL hPuslp, 
AhPusIp, and D146A AhPusIp) on the same H7 SRA substrates 
were assayed by the H' 5 -release method (Table 3; [31,32]). For 
these experiments, we used mouse mitochondrial tRNA Asp as a 
positive control for hPuslp activity [22]. The FL hPuslp is highly 



active towards tRNA Asp , with significant levels of pseudouridine 
formation: while the AhPusIp has lower, but reproducible activity 
clearly indicating that the deleted N- and C-termini are not 
essential for either the RNA binding or the activity of the hPus 1 p 
enzyme (Tables 2 and 3). As expected, no activity could be 
measured with D146A AhPusIp with the catalytic aspartate 
residue 146 mutated (Table 3). The level of pseudouridine 
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Table 2. Affinity constants between hPusIp and the various H7 SRA substrates. 



Protein/RNA Complexes 


Kd InM) 


SD* 


n b 


R 2 


hPus1p/H7 SRA FL 


219 


41 


1 


0.994 


hPus1p/H7 SRA truncation 1 


103 


26 


1 


0.993 


hPus1p/H7 SRA truncation 2 


111 


24 


0.8 


0.994 


hPus1p/H7 SRA truncation 3 


122 


45 


1 


0.982 


hPus1p/H7 SRA truncation 4 


96 


49 


0.7 


0.95 


AhPus1p/H7 SRA FL 


484 


73 


1 


0.993 


AhPus1p/H7 SRA truncation 1 


219 


33 


0.8 


0.994 


AhPus1p/H7 SRA truncation 2 


94 


14 


1 


0.989 


AhPus1p/H7 SRA truncation 3 


95 


21 


0.7 


0.992 


AhPus1p/H7 SRA truncation 4 


74 


13 


0.8 


0.998 


D146A AhPuslp/H7 SRA FL 


2200 


44 


1 


0.991 


D146A AhPus1p/H7 SRA truncation 2 


3900 


65 


1 


0.995 



a Listed are the standard deviation (SD) of two distinct binding experiments. 
b n represents the number of binding sites between the protein and the SRA fragment. 
R 2 represents the coefficient of determination calculated for each fitting curve. 
doi:1 0.1 371 /journal.pone.009461 0.t002 



formation by the FL enzyme observed with several of the H7 SRA 
substrates is significant, although lower than for the tRNA 
(Table 3). It seems likely that this lower level of activity is due to 
the atypical H7 secondary structure. Indeed, the H7 RNA is 
composed of multiple stem loops but does not contain any four- 
way junctions as found in tRNAs. Recent studies of the hPuslp 
enzymatic activity using tRNA-derived substrates have shown the 
importance of having a stem loop 3' to the modified uridine [21]. 
This requirement does not seem to be critical with the SRA 
modification (Figure 1A). The H7 SRA full-length substrate 
cannot be modified significantly by any of the hPuslp constructs. 
This observation may indicate that FL H7 SRA is not folded as 
predicted, possibly due to the absence of the remaining SRA 
sequence. Similar observations have been made for other large 
RNA molecules, such as group I introns, which form peripheral 
interactions essential for their functionality or recognition [40]. 
Truncation 1 is modified in a similar way by either FL or 
AhPuslp, which clearly indicates that the region between 
nucleotide 16 and 39 is not essential for both the association 
and the modification of the SRA by hPuslp (Tables 2 and 3). 
However, shortening the SRA construct further, from either side 
of the targeted uridine (truncation 2, 3, and 4), reduces the activity 
to background levels, although hPuslp can still form a stable 
complex with these sequences (Tables 2 and 3). These activity 
experiments suggest the importance of stem stability for protein- 
RNA recognition, and indicate that hPuslp requires stable stem 
structures on both sides of the previously identified uridine 236 of 
the SRA (Figure 1A and Table 3; [15]). These data complete 
previous observations regarding tRNA modification by hPuslp 
[21]. In this case, the minimal substrate required a stem 3' to the 
modified uridine [21]. It is now clearly established that hPuslp has 
a relaxed sequence specificity, and that stems surrounding the 
targeted uridine play a non-essential role in the efficiency of 
pseudouridylation [13,21,41]. 

Structural insights into the catalytic domain of the 
human Puslp enzyme 

The crystal structures of the AhPus 1 p and the D 1 46A AhPus 1 p 
enzymes have been determined in complex with several molecules 
present in the crystallisation conditions. Crystals of AhPuslp 



belong to space group P2i2[2i, with two molecules per 
asymmetric unit. The structure was refined to 2.0 A resolution, 
with an R„ ork of 18.4% and an R [lee _ of 22.4% (Table 1; Figure 2A). 
Crystals of hPuslp D146A belong to space group P22i2i, and 
contain one molecule per asymmetric unit. The structure was 
refined to 2.7 A resolution (R„ ork of 18.3% and R [ree of 23.8%; 
Figure S3). The two hPuslp structures are essentially identical 
(Figure 2A and Figure S3). In both crystal structures, electron 
density for residues corresponding to the so-called forefinger loop 
(region 100-110) is absent, and so these residues have been 
omitted from the final structure (Figure 2B). A similar situation 
was reported recently by Czudnochowski and colleagues [28] . The 
high flexibility of this loop may be due to the absence of an RNA 
substrate, as this region is contacted by the tRNA in the TruB/ 
tRNA complex structure [10,17]. The region preceding the C- 
terminal helices of hPuslp (residues 348-352) is described by weak 
electron density, and so this region was built in molecule B of the 
AhPuslp structure only (Figure 2B). 

The atomic model shows the typical fold for a pseudouridine 
synthase from the TruA family (Figure 2A and B; [6,28,41]). 
However, the AhPuslp enzyme has the peculiarity of having a C- 
terminal extension of approximately 35 residues, which folds on 
the face opposite to the active site cavity (Figure 2B). This 
extension covers residues involved in the dimerization of the 
bacterial TruA enzyme [6,34,41]. If we superimpose our structure 
of the AhPuslp enzyme onto the bacterial TruA dimer, it clearly 
shows that the mammalian enzyme cannot form a dimer (Figure 
S4). Therefore, this structure provides further evidence that the 
mammalian Puslp enzyme functions as a monomer (Figure 2B). 
Conservation analysis, using the ConSurf server [42] fed with a 
multiple alignment done by the program PipeAlign [43], shows 
that the catalytic core is highly conserved (Figure 2C). Similarly, 
plotting temperature factors of the atomic model also indicates 
that the active site as the most stable region of the protein 
(Figure 2C). Furthermore, we observe that the most conserved and 
stable regions of the AhPuslp structure are formed by the loops 
surrounding the catalytic site, suggesting that this cavity exhibits 
plasticity, possibly to accommodate the various RNA substrates 
(Figure 2C). We also notice that the additional C-terminal helix, 
0t8, is poorly conserved at the sequence level, although all 
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Table 3. Pseudouridine incorporation measured with various hPusIp enzymes. 







RNA substrates 










Enzyme 3 


tRNA Asp 


H7 SRA full length 


H7 SRA truncation 1 


H7 SRA truncation 2 


H7 SRA truncation 3 


H7 SRA truncation 4 


FL hPusIp 


0.46 (0.01) 


0.02 (<0.01) 


0.12 (<0.01) 


0.01 (<0.01) 


0.05 (<0.01) 


0.02 (<0.01) 


AhPusIp 


0.08 (0.01) 


0.00 (<0.01) 


0.10 (0.01) 


0.01 (<0.01) 


0.01 (0.02) 


0.02 (0.02) 


D146A AhPusIp 


0.01 (-C0.01) 


0.00 (<0.01) 


0.00 (<0.01) 


ND 


0.00 (<0.01) 


ND 



a The activities are in moles \|f/moles RNA with the mean of three separate assays and standard deviation (SD). With all of the substrates the maximum activity should be 
— 1.0 moles \|*/moles RNA. Background levels of activity seen with Lac (fj-galactosidase) have been subtracted. ND indicates that the combination of D146A AhPusIp 
D146A and the corresponding RNA substrate was not determined. 
doi:1 0.1 371 /journal.pone.009461 0.t003 



eukaryotic pseudouridine synthase sequences have a predicted 
helical extension at this location. It is likely that such conservation 
of a secondary structural element reflects the functional impor- 
tance of the helix. One possible role would be to stabilise the two 
N- and C-terminal sub-domains of hPuslp (Figure 2B). The helix 
may make hPuslp more rigid so that it behaves as a platform for 
RNA modification with only minor active site adaptations to the 
incoming RNA substrates. 

Differences between structures of AhPusIp and AhPusIp 
D146A are limited to external, poorly conserved loops, which 
confirm the flexibility of these regions previously identified by 
Czudnochowski et al. (Figure 2C; Figure S3; [28]). Extensive co- 
crystallisation assays with small SRA substrates or uridine 
derivatives resulted in diffracting crystals although the correspond- 
ing electron density maps did not show any extra density peaks 
near the active site of AhPusIp (results not shown). However, in 
the presence of the amino acid mixture from the crystallisation 
conditions, the F ohs -F calc and the 2F ohs -F l ak: Fourier electron 
density maps systematically showed strong connected electron 
density near the active site residues D146, Y201, H292, and R295 
(Figure 3). These densities were fitted with the various compounds 
present in the crystallisation conditions, which allowed us to 
identify unambiguously the atypical molecules bound in the active 
site (Figure 3 and Figure S5). Three different molecules were 
bound: i) a lysine in molecule A of the AhPusIp crystal, ii) a 
glutamate in molecule B of the AhPusIp crystal, iii) a HEPES 
molecule in the D146A AhPusIp crystal (Figure S6). 

In molecule A, the carboxyl group of the lysine molecule forms 
a hydrogen bond with the backbone atoms of residue D146, while 
the lysine amide group is bound to the hydroxyl group of residue 
Y201 (Figure 3A). In molecule B of the same crystal form, the 
backbone of the glutamate compound is hydrogen bonded to the 
hydroxyl group of residue Y201, while the main chain and the side 
chain of residue R295 stabilise the OE1 and OE2 positions of the 
glutamate (Figure 3B). The various interactions between these two 
compounds and the backbone atoms of hPuslp are reminiscent to 
the one established between the pseudouridine nucleotide and the 
TruB protein [10,17]. The crystal structures presented here 
indicate that the active site of AhPusIp can accommodate several 
types of molecule through the establishment of multiple hydrogen 
bonds. The carboxyl groups of the two amino acids are located 
near the backbone of residue D 1 46 and occupy roughly the same 
position in both molecules of AhPusIp (Figure 3 A and 3B). 
However, the glutamate and the lysine side chains are orientated 
in opposite directions. The side chain of the glutamate points 
towards residue R295 of a5, while the lysine NZ atom faces 
residue R144 of the catalytic loop. None of these bound molecules 
form hydrogen bonds with the catalytic aspartic acid D146 side 
chain. The presence of these two compounds with opposite ionic 



characteristics, i.e. positively or negatively charged, suggests that 
the active site cavity of hPuslp may mimic a dipole. We speculate 
that such dipolar properties could facilitate the necessary rotation 
of the pyrimidine ring during catalysis. 

A HEPES molecule is bound in the active site of the D 1 46A 
mutant structure (Figure 3C) and is located at the same position as 
the MES compound found in the crystal structure of Puslp solved 
recently (PDB id 4ITS; [28]). The HEPES molecule is bound in 
the active site, and its sulphate group occupies exactly the same 
orientation as the sulphate from the MES molecule (Figure 3C and 
Figure S6). Similarly, N4 of the MES and Nl of the HEPES 
molecule are nearly indistinguishable (Figure 3C and Figure S6). A 
HEPES molecule is bound to the active site in the structure of 
pseudouridine synthase 10, although its orientation was different 
(Figure S6). The NZ atom of the bound lysine is also located 
within the same area, which could indicate that AhPusIp has a 
preferential site for nitrogen stabilisation near the residue D146 - 
the residue initiating the pseudouridine reaction (Figure 3C). In a 
superposition of the Leucyl tRNA-bound TruA (PDB id 2NR0; 
[34]) and the AhPusIp D146A structures, the N9 atom of the 
flipped guanosine is nearly co-incident with the HEPES molecule 
(Figure S6), reinforcing the proposed electropositive region near 
the active site residue D146. The piperazine moiety of the HEPES 
compound superimposes with the ribose ring of the guanosine 
(Figure S6). The positioning of a sulphate group inside the active 
site of AhPusIp may be irrelevant for the enzymatic reaction /w se, 
but clearly indicates that the active site cavity is not neutral, a 
property that was suggested to be important during the catalytic 
cycle (Figure 3 A and 3B). 

Conclusion 

The structures presented here further demonstrate the unusual 
capacity of hPuslp to modify various types of RNA substrates. 
These atomic models also indicate that hPuslp functions as a 
monomer in contrast to its bacterial counterpart, TruA. We 
suggest that the large C-terminal extension, which folds into 
several ot-helices, prevents the dimerization of hPuslp. This 
monomeric behaviour may well explain the diversity of substrates 
that are recognised by the enzyme, since we show that a substrate 
with a large asymmetric loop, preceded and followed by stable 
stems, may be targeted by hPuslp. Our crystallographic models 
contain various small molecules bound to the hPuslp active site, 
which, we believe, may reflect its unexpected electronic properties. 
Further structural characterisation of hPuslp complexed with 
SRA will be necessary to compare the specific determinants 
behind the modification of this ncRNA. 
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Variable Conserved Stable Flexible 



Figure 2. Overall views of the crystal structure of AhPuslp. (A) The AhPusIp monomer is shown in a cartoon representation and coloured 
according to secondary structure elements (red, a-helices; yellow, p-sheet; and green, loops). The C-terminal helices, which are specific to the hPusI p 
protein, are shown in blue. (B) Domain organization of AhPusI p. The N-terminal and C-terminal domains are orange and green, respectively; specific 
C-terminal helices are blue. The two domains form a cavity, which contains the active site. Two functionally important loops, the Forefinger and the 
Thumb loop are labelled. The catalytic amino acid residues are shown as sticks and coloured according to atom type (carbon, nitrogen, oxygen, and 
sulphur are yellow, blue, red, and gold, respectively) (C) Surface conservation and temperature factor representation of hPuslp. Conservation scores 
were determined with the ConSurf server ([42]), and plotted on the surface of the AhPuslp structure (blue, variable; red, conserved,). The 
temperature factors of the refined model were plotted by the same method. Low B factors (stable) are blue and high B factors (flexible) are red. A 
dashed circle (yellow or red) indicates the location of the enzyme active site. 
doi:1 0.1 371 /journal.pone.009461 0.g002 
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A B C 




Figure 3. Close view of the active site residues from the AhPusIp and the AhPusIp D146A crystal structures. (A) A D-lysine molecule 
from the crystallisation condition is bound within the active site of molecule A of the AhPusI p enzyme. (B) A D-glutamate molecule is bound in the 
active site of molecule B of the same crystal structure. (C) A HEPES molecule is found in the active site of the inactive AhPusIp D146A crystal 
structure. Amino acids in the active site are shown and coloured as in Figure 2B. Bound molecules are shown as sticks and coloured according to 
atom type (carbon, nitrogen, oxygen, and sulphur are magenta, blue, red, and gold, respectively). Water molecules are shown as red spheres. 
Hydrogen bonds between the amino acids and the protein atoms are indicated as black dash lines. 
doi:1 0.1 371 /journal.pone.009461 0.g003 



Supporting Information 

Figure SI Size exclusion chromatography of the puri- 
fied WT and D146A hPuslp. Following their expression and 
purification, both proteins were loaded on a superdex200 gel 
filtration column. The WT and the D146A hPuslp are shown as a 
blue and a red trace respectively. 
(TIF) 

Figure S2 Denaturing gel analysis showing the absence 
of covalent 5-FU-RNA/hPuslp complex being formed 
during our binding experiments. 5-FU modified RNA 
truncation 1 (at 36 uM) and the hPuslp (at 20 uM) were 
incubated for 3 h at the indicated temperature. Denaturing buffer 
was then added and the samples were either directly loaded or 
incubated for 5 min at 95°C before loading in a 12% SDS-PAGE. 
(TIF) 

Figure S3 Overall view of the AhPusIp atomic models. 

(A) Structure of the D146A AhPusIp. The protein is shown as a 
cartoon and colored as follow: orange, N-terminal domain; green, 
C-terminal domain; blue, for the C-terminal helices. Amino acids 
important for the catalytic cycle are shown as sticks and coloured 
according to atom type (carbon, yellow; nitrogen, blue; oxygen, 
red). (B) The structures of the two AhPusIp monomers and the 
D146A AhPusIp monomer are superimposed and coloured in 
orange, blue and white respectively. 
(TIF) 

Figure S4 Dimerization of the Pusl enzymes. (A) The 

dimer of the bacterial TruA enzyme is shown as observed in the 
crystal structure of the E. coli TruA enzyme (PDB code 2NR0; 
[34]). (B) Our atomic model of the catalytic domain of the hPuslp 
has been superimposed onto each TruA molecule (coloured in 
orange and green). The C-terminal helices are clearly clashing 
with each other and therefore would prevent such multimerisation 
(coloured in cyan and blue). Protein structures are shown as 
cartoon in both panels. 
(TIF) 



Figure S5 Difference Fourier electron density maps 
found in the active sites of each AhPusIp molecule. The 

left column shows the F 0 -F c electron density map contoured at 2.7 
a for the molecule A. The right column shows the difference 
electron density map found in the molecule B contoured at the 
same level. Individual amino acids present in the crystallization 
buffer were fitted into the densities. Each compound is shown as 
stick colored according to atom type (carbon, gray or purple; 
nitrogen, blue; oxygen, red). The chosen amino acid for each 
AhPusIp molecule is colored in purple. The difference electron 
density map was calculated using the protein only as a model and 
are shown as green mesh. 
(TIF) 

Figure S6 Superposition of the molecules found in the 
active site of various Pseudouridine Synthase 1 struc- 
tures. (A) Superposition of the D146A AhPusIp atomic model 
bound to an HEPES molecule with the catalytic domain of the 
hPus 1 p structure bound to a MES compound (shown as sticks and 
colored in blue for the protein and white for the HEPES; [28]). (B) 
Superposition of the D146A AhPusIp atomic model with the 
structure of the human PUS 1 0 enzyme also bound to an HEPES 
compound (shown as stick and colored in orange for the protein 
and white for the HEPES; PDB code 2V9K; [5]). Amino acids 
from the PUS 10 enzyme are labelled in italic. The HEPES has an 
opposite orientation with the phosphate outside the active site of 
PUS 10. (C) Superposition of the D146A AhPusIp atomic model 
with the structure of the E. coli TruA enzyme in complex with 
tRNA leu (shown as stick and colored in green for the protein and 
white for the guanosine; PDB code 2NR0; [34]). 
(TIF) 
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